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Abstract 

A Baxter algebra is a commutative algebra A that carries a generalized 
integral operator. In the first part of this paper we review past work of 
Baxter, Miller, Rota and Cartier in this area and explain more recent 
work on explicit constructions of free Baxter algebras that extended the 
constructions of Rota and Cartier. In the second part of the paper we 
will use these explicit constructions to relate Baxter algebras to Hopf 
algebras and give applications of Baxter algebras to the umbral calculus 
in combinatorics. 



Introduction 

This is a survey article on Baxter algebras, with emphasis on free Baxter algebras 
and their applications in probability theory, Hopf algebra and umbral calculus. 
This article can be read in conjunction with the excellent introductory article 
of Rota EH1 See also EH ■ 

A Baxter algebra is a commutative algebra R with a linear operator P that 
satisfies the Baxter identity 

P(x)P(y) = P(xP(y)) + P(yP(x)) + \P(xy),Vx, y e R, (0.1) 

where A is a pre-assigned constant, called the weight, from the base ring of R. 

0.1 Relation with differential algebra 

The theory of Baxter algebras is related to differential algebra just as the integral 
analysis is related to the differential analysis. 

*MSC Numbers: Primary 13A99, 47B99. Secondary 12H05, 05A40. 
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Differential algebra originated from differential equations, while the study 
of Baxter algebras originated from the algebraic study by Baxter[S] on integral 
equations arising from fluctuation theory in probability theory. Differential 
algebra has provided the motivation for some of the recent studies on Baxter 
algebras. A Baxter algebra of weight zero is an integration algebra, i.e., an 
algebra with an operator that satisfies the integration by parts formula (see 
Example 1 below). The motivation of the recent work in [1311141 ITS) is to extend 
the beautiful theory of differential algebra to integration algebras. On the 
other hand, the Baxter operators, regarded as a twisted family of integration 
operators, motivated the study of a twisted family of differential operators, 
generalizing the differential operator (when the weight is 0) and the difference 
operator (when the weight is 1). 

0.2 Some history 

In the 1950's and early 1960's, several spectacular results were obtained in the 
fluctuation theory of sums of independent random variables by Anderson pQ, 
Baxter |3J, Foata |§] and Spitzer .'52 . The most important result is Spitzer's 
identity (see Proposition ^. lbll which was applied to show that certain function- 
als of sums of independent random variables, such as the maximum and the 
number of positive partial sum, were independent of the particular distribution. 
In an important paper Baxter deduced Spitzer's identity and several other 
identities from identity HU.lfl . This identity was further studied by Wendel |34j . 
Kingman JS] and Atkinson [3]. 

Rota |27] realized the algebraic and combinatorial significance of this iden- 
tity and started a systemic study of the algebraic structure of Baxter algebras. 
Free Baxter algebras were constructed by him (230^ and Cartier [Hj. Baxter 
algebras were also applied to the study of Schur functions [231221 EH]- hyper- 
geometric functions and symmetric functions, and are closely related to several 
areas in algebra and geometry, such as quantum groups and iterated integrals, 
as well as differential algebra. The two articles by Rota |28M29| include surveys 
in this area and further references. Rota's articles helped to revive the study of 
Baxter algebras in recent years: Baxter sequences in [35) . free Baxter algebras 
in [HEOEniHT] and applications in [HT2] 

Despite the close analogy between differential algebras and Baxter algebras, 
in particular integration algebras, relatively little is known about Baxter alge- 
bras in comparison with differential algebras. It is our hope that this article will 
further promote the study of Baxter algebras and related algebraic structures. 

0.3 Outline 

After introducing notations and examples in Section^] we will focus on the con- 
struction of free Baxter algebras in Section|21 We will give three constructions of 
free Baxter algebras. We first explain Cartier's construction using brackets, fol- 
lowed by a similar construction using a generalization of shuffle products. These 
two constructions are "external" in the sense that each is a free Baxter algebra 
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obtained without reference to any other Baxter algebra. We then explain Rota's 
standard Baxter algebra which chronologically came first. Rota's construction 
is an "internal" construction, obtained as a Baxter subalgebra inside a naturally 
defined Baxter algebra whose construction traces back to Baxter jS] . 

In Section [3 we give two applications of Baxter algebras. We use free Bax- 
ter algebras to construct a new class of Hopf algebras, generalizing the classical 
divided power Hopf algebra. We then use Baxter algebras to give an interpreta- 
tion and generalization of the umbral calculus. Other applications of free Baxter 
algebras can be found in Section^ relating Baxter operators to integration and 
summation, and in Section |2 proving the famous formula of Spitzer. 

We are not able to include some other work on Baxter algebras, for example 
on Baxter sequences and the Young tableau (331 and on zero divisors and 
chain conditions in free Baxter algebras jlOHllj . We refer the interested readers 
to the original literature. 

1 Definitions, examples and basic properties 
1.1 Definitions and examples 

We will only consider rings and algebras with identity in this paper. If R is the 
ring or algebra, the identity will be denoted by 1r, or by 1 if there is no danger 
of confusion. 

Deflniton 1.1 Let C be a commutative ring. Fix a A in C. A Baxter C -algebra 
{of weight A) is a commutative C-algebra R together with a Baxter operator {of 
weight A) on R, that is, a C-linear operator P : R —> R such that 



for any x,y G R. 

Let Baxc = Baxc,A denote the category of Baxter C-algebras of weight A 
in which the morphisms are algebra homomorphisms that commute with the 
Baxter operators. 

There are many examples of Baxter algebras. 

Example 1.2 (Integration) Let R be Cont(K), the ring of continuous func- 
tions on E. For / in Cont(R), define P{f) G Cont(K) by 



P{x)P{y) = P{xP{y)) + P{yP{x)) + \P(xy), 




Then (Cont(M), P) is a Baxter algebra of weight zero. 
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Example 1.3 (Divided power algebra) This is the algebra 

R = Ce n 

n>0 

on which the multiplication is defined by 

(m+n \ 
J ^m+ni ^3 71 ^ 0. 
m / 

The operator P : R — > P, where P(e„) = e„ + i, n > 0, is a Baxter operator of 
weight zero. 

Example 1.4 (Hurwitz series) Let R be 

£TC := {(a n )|a n e C,n e N), 

the ring of Hurwitz series |17j . The addition is defined componentwise, and 
the multiplication is given by (a n )(b n ) = (c„), where c„ = Ylk=o (k) a *bn-k- 
Define 

P :HC -> ffC, P((a„)) = (a„_i), where a_i = 0. 

Then i?C is a Baxter algebra of weight which is the completion of the divided 
power algebra. 

We will return to Example 11.31 and ll . 41 in Section 12^1 

Example 1.5 (Scalar multiplication) Let R be any C-algebra. For a given 
A e C, define 

P A : R-> R,x^ -\x,\/x E R. 
Then P\ is a Baxter operator of weight A on R. 

Example 1.6 (Partial sums) This is one of the first examples of a Baxter 
algebra, introduced by Baxter Let A be any C-algebra. Let 

R = Y[ A = {(ai, a 2 , . . .)|a„ G A, n e N+}. 

with addition, multiplication and scalar product defined entry by entry. Define 
P : R — » P to be the "partial sum" operator: 

P(oi, a 2 , • • •) = A(0, oi, ai + a 2 , ai + a 2 + a 3 , . . .). 

Then P is a Baxter operator of weight A on R. 

We will return to this example in Section EH 
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Example 1.7 (Distributions [5 ) Let R be the Banach algebra of functions 

<p(t) = / e ltx dF{x) 



where F is a function such that := \dF(x)\ < oo and such that 

P(— oo) := lim F(x) exists. The addition and multiplication are defined 

x — > — oo 

pointwise. Let P(ip)(i) = J °° e ltx dF{x) + P(0) - F(-oo). Then (R,P) is a 
Baxter algebra of weight —1. 

We will come back to this example in Proposition ^. 161 

Example 1.8 This is an important example in combinatorics jHJ- Let R be 
the ring of functions / : R — > R with finite support in which the product is the 
convolution: 

(fg)(x) :=£ -y), * e K. 

Define P : P -» P by 

TOO*) = E /(*). * e R - 

y^]R,max(0,y)— a; 

Then (P, P) is a Baxter algebra of weight — 1. 
1.2 Integrations and summations 

Define a system of polynomials $ n (x) G Q[s] by the generating function 

e l — 1 ' n! 

n=0 

Then we have 

® n (x) = B n (x) - B n , 
where B n {x) is the n-th Bernoulli polynomial, defined by the generating function 

te xt ^ , J n 



and B n = B n (0) is the n-th Bernoulli number. It is well-known (see |161 Sec- 
tion 15.1]) that 



k 



M* + l) = (n+l)^r" (1.9) 



r=l 



for any integer k > 1. The following property of Baxter algebras is due to Miller 
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Proposition 1.10 Let C be a Q-algebra and let R = C[i\. Let P be a C-linear 
operator on R such that P(l) = t. 

(a) The operator P is a Baxter operator of weight if and only if P(t n ) — 

4t*" +1 - 

n+1 

(b) The operator P is a Baxter operator of weight — 1 if and only if P(t n ) = 

^$n + l (*+!)■ 

(c) The operator P is a Baxter operator of weight —1 if and only if P(t n )(k) — 

k 

r™ for every integer k > 1 . 

r=l 



Proof, (a) The only if part can be easily proved by induction. For details see 
the proof of Theorem 3 in 21. . The if part is obvious. 



(b) is Theorem 4 in 21 . 



(c) Because of |(b)| we only need to show 



k 

P(t n ) = —r^n+l (t + 1) O P(t n )(k) = Vr",Vfc > 1. 
n+1 L — ' 

r— 1 

(=>) follows from l|1.9fl . {<=) can be seen easily, say from Lemma 5]. □ 

Using this proposition and free Baxter algebras that we will construct in 
the next section, we will prove the following property of Baxter algebras in 
Section [O 



Proposition 1.11 Let C be a Q-algebra and let (R, P) be a Baxter C-algebra 
of weight A. Lett be P(l). 

(a) If A = 0, then P(t n ) = ^ T t' l+1 for alln>l. 

(b) If A = -1, then P(t n ) = + 1). 



Because of Proposition 11.101 and 11.111 a Baxter operator of weight zero is 
also called an anti-derivation or integration, and a Baxter operator of weight 
— 1 is also called a summation operator |21|. 



2 Free Baxter algebras 

Free objects are usually defined to be generated by sets. We give the following 
more general definition. 



Definiton 2.1 Let A be a C-algebra. A Baxter C-algebra (Fc(A), Pa), to- 
gether with a C-algebra homomorphism ja '■ A — > Fq {A) , is called a free Baxter 
C-algebra on A (of weight A), if, for any Baxter C-algebra (R, P) of weight A 
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and any C-algebra homomorphism ip : A — ► R, there exists a unique Baxter 
C-algebra homomorphism (p : (Fc(A),Pa) — ► (R,P) such that the diagram 




R 



commutes. 

Let X be a set. One can define a free Baxter C-algebra (Fc(X), Px) on X in 
a similar way. This Baxter algebra is naturally isomorphic to (Fc(C[Xj), Pc\x]), 
where C[X] is the polynomial algebra over C generated by the set X. 

Free Baxter algebras play a central role in the study of Baxter algebras. Even 
though the existence of free Baxter algebras follows from the general theory of 
universal algebras [20|, in order to get a good understanding of free Baxter 
algebras, it is desirable to find concrete constructions. This is in analogy to the 
important role played by the ring of polynomials in the study of commutative 
algebra. 

Free Baxter algebras on sets were first constructed by Rota [57j and Cartier jS] 
in the category of Baxter algebras with no identities (with some restrictions on 
the weight and the base ring). Two more general constructions have been ob- 
tained recently |14l 115) . The first construction is in terms of mixable shuffle 
products which generalize the well-known shuffle products of path integrals de- 
veloped by Chen p] and Ree j2HJ ■ The second construction is modified after the 
construction of Rota. 

The shuffle product construction of a free Baxter algebra has the advantage 
that its module structure and Baxter operator can be easily described. The 
construction of a free Baxter algebra as a standard Baxter algebra has the ad- 
vantage that its multiplication is very simple. There is a canonical isomorphism 
between the shuffle Baxter algebra and the standard Baxter algebra. This iso- 
morphism enables us to make use of properties of both constructions. 

2.1 Free Baxter algebras of Cartier 

We first recall the construction of free Baxter algebras by Cartier|Bl^ . The 
original construction of Cartier is for free objects on a set, in the category of 
algebras with no identity and with weight —1. We will extend his construction to 
free objects in the category of algebras with identity and with arbitrary weight. 
We will still consider free objects on sets. In order to consider free objects on a 
C-algebra A, it will be more convenient to use the tensor product notation to 
be introduced in Section 

Let X be a set. Let A be a fixed element in C. Let M be the free commutative 
semigroup with identity on X. Let X denote the set of symbols of the form 

mo • [ ], u e M, 
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and 

u a ■ [tii, ■ ■ ■ ,u m ],m > 1, u (h ui, ...,u m e M. 

Let 25(X) be the free C-module on X. Cartier gave a C-bilinear multiplication 
o c on 23 (X) by defining 

(uo ■ [ ]) o c (vo ■ [ ]) = uovq-[], 
{u -[])o c (v -[vi,...,v n ]) = (v -[vi,...,v n ])o c (u -[]) 

= u v ■ [vi, . . . ,v n ], 

and 

(uo ■ [ui, . . . ,u m \) o c (v ■ [vi, . . .,v n }) 

= \ m+n ~ k u v Q ■$k,p,Q([ui,...,u m ],[v 1 ...,v n ]). 

(k,P,Q)£S c (m,n) 

Here S c (m, n) is the set of triples (k, P, Q) in which k is an integer between 1 and 
m + n, P and Q are ordered subsets of {1, . . . , k} with the natural ordering such 
that PU Q = {1, . . ., k}, \P\ = m and \Q\ = n. For each (k,P,Q) £ S c (m,ri), 
$/s,p,q([«i, ■ ■ ■ ,u m ], [vi,...,v n ]) is the element [wi,... ,w k ] in X defined by 

!u ai if j is the a-th element in P and j ^ Q; 
Vfj, if j is the /3-th element in Q and j P; 
u a vp, if j is the a-th element in P and the /3-th element in Q. 

Define a C-linear operator P£ on 5B(X) by 

P c x {uv[]) = l.[uo], 
Py(uo« = 1- [u ,ui,...,u m ]. 

The following theorem is a modification of Theorem 1 in 6 and can be proved 
in the same way. 

Theorem 2.2 The pair (%5(X), P^) is a free Baxter algebra on X of weight A 
in the category Baxp. 

2.2 Mixable shuffle Baxter algebras 

We now describe the mixable shuffle Baxter algebras. It gives another construc- 
tion of free Baxter algebras. An advantage of this construction is that it is 
related to the well-known shuffle products. It also enables us to consider the 
free object on a C-algebra A. 

Intuitively, to form the shuffle product, one starts with two decks of cards 
and puts together all possible shuffles of the two decks. Similarly, to form the 
mixable shuffle product, one starts with two decks of charged cards, one deck 
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positively charged and the other negatively charged. When a shuffle of the two 
decks is taken, some of the adjacent pairs of cards with opposite charges are 
allowed to be merged into one card. When one puts all such "mixable shuffles" 
together, with a proper measuring of the number of pairs that have been merged, 
one gets the mixable shuffle product. We now give a precise description of the 
construction. 



2.2.1 Mixable shuffles 

For to, n G N+, define the set of (to, n)-shuffles by 

S(m,n) = \aeS m+n ^ < JT^ </ " -<^ m) ' u A . 
y ' ; [_ + a i (m + l)<cr 1 (m + 2) < . . . < a L (m + n) J 

Here S m + n is the symmetric group on to + n letters. Given an (to, n)-shuffle 
a G S(m, n), a pair of indices (k,k + l), 1 < k < to + n, is called an admissible 
pair for a if a(k) < to < a(k + 1). Denote T a for the set of admissible pairs for 
a. For a subset T of T' 7 , call the pair (a, T) a mixable (to, n) -shuffle. Let | T 
be the cardinality of T. Identify (a, T) with a if T is the empty set. Denote 

S(m, n) = {(a, T) \ a G S(m, n), T C T ff } 

for the set of mixable (m,n)- shuffles. 



Example 2.3 There are three (2, 1) shuffles: 



1 2 3 \ / 1 2 3 \ / 1 2 3 



" : " 1 1 2 :', J ■ " \ 1 3 2 ) ■ n:i \ :\ 1 2 

The pair (2, 3) is an admissible pair for <j\. The pair (1, 2) is an admissible pair 
for (72- There are no admissible pairs for 0-3. 

For A G Alg c and n > 0, let A® n be the n-th tensor power of A over C with 
the convention A m = C. For x = Xi ® . . . ® z m G A* 8 " 1 , y = yi<& . . .®y n G A®" 
and (<r, T) G S(m,n), the clement 

cr(a; ® y) = u ff(1) ® u ct(2) ® . . . ® u CT(m+n) G A^( m +"), 

where 

X k , 1 < fc < TO, 

J/fe-m> TO+l<fc<TO + n, 

is called a shuffle of a; and y; the element 

cr(a; ®y;T) = u a{1) ®u a{2) ® . . . %u a{m+n) G 
where for each pair (k, k + 1), 1 < fc < to + n, 

is called a mixable shuffle of x and y. 



Uk 
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Example 2.4 For x = X\ <8 x 2 G A® 2 and y = y% G A, there are three shuffles 
of x and y: 

x\ ® x 2 <8 2/1, Xi <8 yi <8 x 2 , yi <8 Xi <8 x 2 . 
For the mixable shuffles of x and y, we have in addition, 

xi <8 x 2 yi, xij/i ® x 2 . 

Now fix A G C. Define, for a; and j/ above, 

xo+j,= J] A |T| cr(x<8 2/;T) G ,4® fe . (2.5) 

(<j,T)eS(m,n) fe<m+n 

Example 2.6 For x, y in our previous example, 

a; o+y = xi <8 x 2 €5 i/i + Xi (8 j/i (8) x 2 + ® Xi (8) x 2 + A(xi (8 x 2 J/i + Xij/i €5 x 2 ). 

2.2.2 Shuffle Baxter algebras 

The operation o + extends to a map 

o+ : A® m x A® n -> A® fe , m, n G N 
by C-linearity. Let 

III+(yl) = HIJ(A A) = A® k = C (B A(B A® 2 © . . . . 

Extending by additivity, the binary operation o + gives a C-bilinear map 

o+ : m+(A) x ni+(A) - ni+(A) 

with the convention that 

C x A®" 1 > A® m 

is the scalar multiplication. 

Theorem 2.7 14 The mixable shuffle product o+ defines an associative, com- 
mutative binary operation on III J (A) = © fceN A® fe , making it into a C-algebra 
with the identity l c G C = A® . 

Define III c 04) = HIc(A A) = A ®c UI^A) to be the tensor product alge- 
bra. Define a C-linear endomorphism Pa on LU<7(A) by assigning 

Pa(xq <8 aci (8 ... (8 x„) = 1a <8 x ® xi (8 . . . (8 x n , 
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for all xq ® x\ ® . . . ® x n £ J 4®(™+ 1 ) and extending by additivity. Let ]a '■ A — * 
ni(7(A) be the canonical inclusion map. Call (lHc(A), Pa) the (mixable) shuffle 
Baxter C -algebra on A of weight A. 

For a given set X, we also let (lHc(X), Px) denote the shuffle Baxter C- 
algebra (III(7(C[X]), Pc\x\)i called the (mixable) shuffle Baxter C-algebra on X 
(of weight A). Let jx ■ X — > UIc(X) be the canonical inclusion map. 

Theorem 2.8 [Hj The shuffle Baxter algebra (U1 C (A), P A ), together with the 
natural embedding ja, is a free Baxter C-algebra on A of weight A. Similarly, 
(Ulc(X), Px), together with the natural embedding jx, is a free Baxter C- 
algebra on X of weight A. 

2.2.3 Relation with Cartier's construction 

The mixable shuffle product construction of free Baxter algebras is canonically 
isomorphic to Cartier's construction. Using the notations from Section PZTl wc 
define a map / : X — > Ulc(X) by 

f(u ■[}) = u ; 
f(u - [ui,...,u m ]) = u <g) ui ® . . . ® u m , 

and extend it by C-linearity to a C-linear map 

/ : <8{X) -> nic(X). 

The same argument as for Proposition 5.1 in ^] Prop. 5.1] can be used to 
prove the following 

Proposition 2.9 / is an isomorphism in Bax^. 

Let A be a C-algebra. Using the tensor product notation in the construction 
of HIc(A), one can extend Cartier's construction (Theorem 12. 2fl and Proposi- 
tion for free Baxter algebras on A. 

2.2.4 Special cases 

We next consider some special cases of the shuffle Baxter algebras. 

Case 1: A = 0. In this case, nic(^4) is the usual shuffle algebra generated by 
the C-module A. It played a central role in the work of K.T. Chen [7] on path 
integrals and is related to many areas of pure and applied mathematics. 

Case 2: X = <j> . Taking A = C, we get 

OO CO 

UI C (C) = = Cl^ n+1 \ 

n=0 n=0 
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where = l c <g> . . , eg \ c . In this case the mixable shuffle product for- 

v » ' 

(n+1) —factors 

mula (|2.5|) gives 

Proposition 2.10 For any m,n G N, 

m 

We are now ready to prove Proposition !!.!!! : By Proposition 4.2 in [T3|, the 

map 

iu c (c) -» c[x], 



^ afo - A) • ■ ■ (a; - A(n - 1)) 



, n > 0, 

n! 



is an isomorphism of C-algebras. Then Pq enables us to define a Baxter op- 
erator Q on C[x] through this isomorphism and we have Q(l) = x. Then by 
Proposition II. 1UI we have 



Q(x n ) 



rxra; n+1 , if A = 0, 

^$ n+1 (a: + l), if A= -1. 



Now let (i?, P) be any Baxter C-algebra. By the universal property of (C[x], Q) 
(= (nic(C), Pc)) stated in Theorem 12.81 there is a unique homomorphism 
ip : (C[x],Q) — > {R,P) of Baxter algebras such that <p(x) = P(l). Let t = 
P(l) = (p(Q(l)) = (p(x) and since & n+ i(x + 1) has coefficients in Q C C, we 
have 

p(n = &(Q(x n )) = $ ( -^-x n+1 ) = -^ ri+1 

\ n + 1 / n + 1 

when A = and 

p(n = $(Q(x n )) = ip (-L-$ n+1 (x + 1)) = -^—<P n+1 (t + 1) 

\n+ 1 / n + 1 

when A = — 1. This proves Proposition ll.il! 

Case 3: A = and X = cf>. Taking the "pull-back" of Cases 1 and 2, we get 
{IU C (0,O)} - {m c (X,0)\X G Sets} 

(2.11) 

{Ul c {(f>,X)\X G C} — {UI C (X,X)\X G Sets, A G C} 
Thus IIIc? (0j 0) is the divided power algebra 

III c (^,0) =^Ce fc , e„e m = f m+ ™J e m+n 
feeN 

in Example 1 1.31 
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2.2.5 Variation: Complete shuffle Baxter algebras 

We now consider the completion of HIc(A). 

Given k £ N, Fil fc ni c (^) := 0„> fe - 4 ® ( ™ +1) is a Baxter ideal of m c {A). 
Consider the infinite product of C-modules UI C (A) = ]J ken yj® (fc+1) . It con- 
tains UIc(^4) as a dense subset with respect to the topology defined by the 
filtration {Fil fc HI c (^)}- All operations of the Baxter C-algebra U1 C (A) are 
continuous with respect to this topology. Hence they extend uniquely to op- 
erations on Hlc(yl), making HIc(y4) a Baxter algebra of weight A, with the 
Baxter operator denoted by P. It is called the complete shuffle Baxter algebra 
on A. It naturally contains HIc(A) as a Baxter subalgebra and is a free object 
in the category of Baxter algebras that are complete with respect to a canonical 
filtration defined by the Baxter operator |15|. 

When A — C and A = 0, we have 

Sc(C A) = l[Ce k ^HC, 

k£N 

the ring of Hurwitz series in Example 11.41 
2.3 Standard Baxter algebras 

The standard Baxter algebra constructed by Rota in is a free object in 
the category Bax^ of Baxter algebras not necessarily having an identity. It is 
described as a Baxter subalgebra of another Baxter algebra whose construction 
goes back to Baxter [o . In Rota's construction, there are further restrictions 
that C be a field of characteristic zero, the free Baxter algebra obtained be on a 
finite set X, and the weight A be 1. By making use of shuffle Baxter algebras, we 
will show that Rota's description can be modified to yield a free Baxter algebra 
on an algebra in the category Baxc of Baxter algebras with an identity, with a 
mild restriction on the weight A. We can also provide a similar construction for 
algebras not necessarily having an identity, and for complete Baxter algebras, 
but we will not explain it here. See |15) . 

We will first present Rota's construction, modified to give free objects in the 
category of Baxter algebras with identity. We then give the general construction. 

2.3.1 The standard Baxter algebra of Rota 

For details, see [771151] . 

As before, let C be a commutative ring with an identity, and fix a A in C. 
Let X be a given set. For each x € X, let t^ be a sequence (4 > ■ ■ ■ , tn > ■ ■ ■) 
of distinct symbols i„ . We also require that the sets {<« } n and {tn^} n be 
disjoint for x\ ^ xi in X. Denote 
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and denote by 2l(X) the ring of sequences with entries in C[A], the C-algebra of 
polynomials with variables in A. Thus the addition, multiplication and scalar 
multiplication by C[X] in 2l(X) are defined componentwise. Alternatively, for 
k £ N+, denote 7^ for the sequence (S nt k)n, where 6 7li k is the Kronecker delta. 
Then we can identify a sequence (a„)„ in 21(A) with a series 

00 

2J a nln = ai7l + °272 + 

71=1 

Then the addition, multiplication and scalar multiplication by C[X] are given 
termwise. 
Define 

Px = P r x,x ■ - 21(A) 

by 

Px( a i> a 2, a 3 , . . .) = A(0, ai,ai+ a 2 ,ai + a 2 + a 3 , . . .). 

In other words, each entry of P£(a), a = (ai, 02, . . .), is A times the sum of the 
previous entries of a. If elements in 21(A) are described by series Y^n=i a nln 
given above, then we simply have 

(00 \ 00 /n— 1 \ 

n=l ) n=l / 

It is well-known (5J|57] that, for A = 1, P£ defines a Baxter operator of weight 
1 on 21(A). It follows that, for any A G C, P£ defines a Baxter operator of 
weight A on 21(A). Hence (21(A), P r x ) is in Bax c . 

Definiton 2.12 The standard Baxter algebra on A is the Baxter subalgebra 
6(A) of 21(A) generated by the sequences t^) = (t[ x \ . . . , a;|f \ . . .), 1 e A. 

An important result of Rota |271 l3"T] is 

Theorem 2.13 (6(A), P£) is a free Baxter algebra on X in the category 
Baxc. 

2.3.2 The standard Baxter algebra in general 

Given A e Alg c , we now give an alternative construction of a free Baxter 
algebra on A in the category Baxc . 

For each n E N + , denote by A® n the n-th. tensor power algebra where the 
tensor product is taken over C . Note that the multiplication on A® n here is 
different from the multiplication on A® n when it is regarded as a C-submodule 

of m c {A). 

Consider the direct limit algebra 

A = limA® n 
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where the transition map is given by 

A ®n > A ®(n+l) ^ x ^ x g ^ 

Let 21(A) be the set of sequences with entries in A. Thus we have 

oo ( oo ^ 

21(A) = Y[ ~A~ln = I a «7n, a ™ 6 3 f ' 

Define addition, multiplication and scalar multiplication on 21(A) component- 
wise, making 21(A) into a A-algebra, with the sequence (1, 1, . . .) as the identity. 
Define 

P\ = P r A>x : 21(A) - 21(A) 

by 

P A (a 1 ,a 2 , a 3 , . . .) = A(0, ax, ai + a 2 , a 1 +a 2 + a 3 , . . .). 
Then (21(A), P r A ) is in Bax c . For each a £ A, define t^ = (t[ a) ) k in 21(A) by 



a, i = k . 
1, i^k. 



Definiton 2.14 The standard Baxter algebra on A is the Baxter subalgebra 
6(A) of 21(A) generated by the sequences = {t[ a \ . . . , t ( n\ ...), a e A. 

Since IIIc(A) is a free Baxter algebra on A, the C-algebra morphism 

A^2l(A),a^( Q ) 
extends uniquely to a morphism in Baxp 

$ : III C (A) -> 21(A). 



Theorem 2.15 Assume that A G C is not a zero divisor in A. The mor- 
phism in Bax<7 

$ : ni c (A) -> 6(A) 
induced by sending a e A to = (t$°' ) , . . . , f« , . . .) is an isomorphism. 

Consequently, when A is not a zero divisor in A, (6(A), P A ) is a free Baxter 
algebra on A in the category Baxc- 



15 



2.3.3 Spitzer's identity 

As an application of the standard Baxter algebra, we recall the proof of Spitzer's 
identity by Rota Spitzer's identity is regarded as a remarkable stepping- 

stone in the theory of sums of independent random variables and motivates of 
Baxter's identity. For other proofs of Spitzer's identity, see El 031 01 EH • We 
first present an algebraic formulation. 

Proposition 2.16 3T] Let C be a Q-algebra. Let (R, P) be a Baxter C-algebra 
of weight 1. Then for b S R, we have 

oo 

exp (-P(log(l + tb)- 1 )) = t n {Pb) [n] (2.17) 

in the ring of power series R\\t]]- Here 

(Pb) [n] = P{b{P(b...(Pb)...))) 

V v ' 

n-itcration 

with the convention that (Pb) [1] = P(b) and (Pb)^ = 1. 

Proof. First let x = (xi,X2, ■ ■ •) where Xi, i > 1, are symbols and let X = {x}. 
Consider the standard Baxter algebra & (X). It is easy to verify that 

(Pa;)!"! = (Q ; e n (xi), e n (x 1 ,x 2 ), e n (x 1 ,x 2 ,x 3 ), . . .) 

where e n {x\, . . . ,x m ) is the elementary symmetric function of degree n in the 
variables x\, . . . , x m . By definition, 

P(x k ) = (0, x\, x\ + x\,x\ + x k 2 + x%, . . . ,p k (x 1: x m ), . . .), 

where p k (xi, ...,x m ) the power sum symmetric function 

of degree k in the variables x%, . . . , x m . These two classes of symmetric functions 
are related by the well-known Waring's formula |3"T] 

(00 \ 00 

-^(-l) fe t fc p fe (xi, . . .,x m )/k = ^2e n (x 1: . . .,x m )t n , V to > 1. 
k=l ) n=0 

This proves 

00 

exp(-P^(log(l + te)- 1 )) =Y,t n ( p xx) ln] - (2-18) 

n=0 

Next let (R, P) be any Baxter C-algebra and let b be any element in R. 
By the universal property of the free Baxter algebra (S, P x )i there is a unique 
Baxter algebra homomorphism dp : 6 — > R such that <p(x) = b. Since all 
the coefficients in the expansion of log(l + u) and exp(it) are rational, and 
ifi o P£ — P o dp, applying dp to l|2.18|l gives the desired equation. □ 
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We can now specialize to the original identity of Spitzer, following Baxter |S] 
and Rota Consider the Baxter algebra (R, P) in Example 1 1.71 Let {Xk} be 
a sequence of independent random variables with identical distribution function 
F(x) and characteristic function 



Let S n = X\ + . . . + X n and let M n = max(0, Si,S%, . . . , S n ). Let F n (x) — 
Prob(Af„ < x) (Prob for probability) be the distribution function of M n . We 
note that, if f(s) is the characteristic function of the random variable of X, 
then P(/)(s) is the characteristic function of the random variable max(0, X). 
Applying Proposition 12 . 161 to b = ip(s), we obtain the identity first obtained by 
Spitzer: 



We refer the reader to jsH] for the application of the standard Baxter algebra 
to the proof of some other identities, such as the Bohnenblust-Spitzer formula. 

3 Further applications of free Baxter algebras 
3.1 Overview 

Recall that the free Baxter algebra lIlc(A, A) in the special case when A = C 
and A = is the divided power algebra. The divided power algebra and its 
completion are known to be related to 

• crystalline cohomology and rings of p-adic periods in number theory, 

• shuffle products in differential geometry and topology, 

• Hopf algebra in commutative algebra, 

• Hurwitz series in differential algebra, 

• umbral calculus in combinatorics, and 

• incidence algebra in graph theory. 

By the "pull-back" diagram (|2.11|) . free (complete) Baxter algebras give a vast 
generalization of the (complete) divided power algebra, and so suggest a frame- 
work in which these connections and applications of the divided power algebra 
can be extended. We give two such connections and applications in the next 
two sections, one to Hopf algebras (Section I3.2J1 and one to the umbral calculus 
in combinatorics (Section I3.3JI . 
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3.2 Hopf algebra 

3.2.1 Definition of Hopf algebra 

We recall some basic definitions and facts. Recall that a cocommutative C- 
coalgebra is a triple (A, A, e) where A is a C-module, and A : A — > A ® A and 
e : ^4 — > C arc C-linear maps that make the following diagrams commute. 

A A® A 

I A | idlglA (3.1) 

A® ,4 ^ d 

C®A e ^- d A®,4 ^ ,4®C 

~\ tA A (3-2) 



A 



A® A A® A 



(3.3) 



where ta,a : A @> A A ® A is defined by ta.a(x ®>y) = y ®) x. 

Recall that a C-bialgebra is a quintuple (A, /i, 77, A, e) where (A,jj,,r]) is a 
C-algebra and (A, A, e) is a C-coalgebra such that fi and 77 are morphisms of 
coalgebras. 

Let (A, jU, 77, A, e) be a C-bialgebra. For C-linear maps /, g : A — > A, the 
convolution / ★ 5 of / and g is the composition of the maps 

A C-lincar endomorphism S of ^4 is called an antipode for A if 

5 * idyi — idA * 5 = 17 o e. (3.4) 
A -ffop/ algebra is a bialgebra A with an antipode 5. 

3.2.2 The main theorem 

On the Baxter algebra IUc(C, A), let fi be the canonical multiplication and let 
77 : C IIIc(C, A) be the unit map. Define a comultiplication A, a counit e 
and an antipode S by 

A = A\ : in c (CA) ^in c (CA)®III c (C,A), 

n 71 — k 

fc=0 8=0 

e = e x : UIc(CA) <7, a 

S = S X : IHc(CA) -IH C (<7,A), a n ~ (-1)" ]T ("~ 3 ) 
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The following result is proved in 

Theorem 3.5 The sextuple (Ulc(C, A), fi, 77, A, e, S) is a Hopf C-algebra. 

3.3 The umbral calculus 
3.3.1 Definition and examples 

For simplicity, we assume that C is a Q-algebra for the rest of the paper. 

The umbral calculus is the study and application of polynomial sequences of 
binomial type, i.e., polynomial sequences {p n {x) \ n £ N} in C[x] such that 

71 

Pn{x + V) = ^ ( Pk(x)Pn-k(y) 
k=0 

in C[x, y) for all n. Such a sequence behaves as if its terms are powers of x 
and has found applications in several areas of pure and applied mathematics, 
including number theory and combinatorics, since the 19th century. There are 
many well-known sequences of binomial types. 

Examples 3.6 (a) Monomials. x n . 

(b) Lower factorial polynomials. (x) n — x(x — 1) ■ • • (x — n + 1). 

(c) Exponential polynomials. <fi n (x) — X)fc=o S( n > k)x k , where S(n,k) with 
n, k > are the Stirling numbers of the second kind. 

(d) Abel polynomials. Fix a^O. A n (x) — x(x — an)" -1 . 

(e) Mittag-Leffler polynomials. M n (x) = £fc = o (J) 2 fc (x) fe . 

(f) Bessel polynomials. y n {x) = Y^k=o (n-k)\k\ (f ) k ( a solution to the Bessel 
equation x 2 y" + (2x + 2)y' + n(n + l)y = 0). 

There are also Bell polynomials, Hermite polynomials, Bernoulli polynomials, 
Euler polynomials, .... 

As useful as umbral calculus is in many areas of mathematics, the founda- 
tions of umbral calculus were not firmly established for over a hundred years. 
Vaguely speaking, the difficulty in the study is that such sequences do not ob- 
serve the algebra rules of C[x]. Rota embarked on laying down the foundation of 
umbral calculus during the same period of time as when he started the algebraic 
study of Baxter algebras. Rota's discovery is that these sequences do observe 
the algebra rules of the dual algebra (the umbral algebra), or in a fancier lan- 
guage, the coalgebra rules of C[x\. Rota's pioneer work was completed over 
the next decade by Rota and his collaborators |3U1 1251 01) . Since then, there 
have been a number of generalizations of the umbral calculus. 
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We will give a characterization of umbral calculus in terms of free Baxter 
algebras by showing that the umbral algebra is the free Baxter algebra of weight 
zero on the empty set. We also characterize the polynomial sequences studied 
in umbral calculus in terms of operations in free Baxter algebras. 

We will then use the free Baxter algebra formulation of the umbral calculus 
to give a generalization of the umbral calculus, called the X-umbral calculus for 
each constant A. The umbral calculus of Rota is the special case when A = 0. 

3.3.2 Rota's umbral algebra 

In order to describe the binomial sequences, Rota and his collaborators identify 
C[x] as the dual of the algebra C[[f]], called the umbral algebra, (algebra plus 
the duality). To identify C[[t]] with the dual of C[x], let t n = £, n G N. Then 

t m t n = t m +n, m, n e N. (3.7) 

V m / 

The C-algebra C[[f]], together with the basis {t n } is called the umbral algebra. 

We can identify C[[t}] with the dual C-module of C[x] by taking {t n } to be 
the dual basis of {x n }. In other words, tk is defined by 

t k : C[x] -> C, x n i-> 8 k ,n, k, n e N. 

Rota and his collaborators removed the mystery of sequences of binomial type 
and Sheffer sequences by showing that such sequences have a simple character- 
ization in terms of the umbral algebra. 

Let f n , n > 0, be a pseudo-basis of C[[i]]. That is, /„, n > are linearly 
independent and generate C[[t]] as a topological C-module where the topology 
on C[[t]} is defined by the filtration 

^ ={!>**}• 

A pseudo-basis f n , n > 0, of C[[t]] is called a divided power pseudo-basis if 

frnfn = Jm+n, TU, 11 > 0. 

V 771 / 

Theorem 3.8 [551125] 

(a) A polynomial sequence {p n (x)} is of binomial type if and only if it is the 
dual basis of a divided power pseudo-basis of C[[t]]. 

(b) Any divided power pseudo-basis of C[[t]] is of the form f n (t) = - 4^ for 

oo 

some f E C[[t]} with ord/ = 1 (that is, /(<) = c kt k , ci ^ 0). 

k=l 

This theorem completely determines all polynomial sequences of binomial type. 
Algorithms to determine such sequences effectively have also been developed. 
See the book by Roman (51] for details. 
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3.3.3 A-umbral calculus 

Our first observation is that, with the operator 

P : C[[t]] -» C[[i\], t n »t n+1 , 

C[[t]] becomes a Baxter algebra of weight zero, isomorphic to IHc(C, 0). More 
generally, we have 

Theorem 3.9 ^2] ^ sequence {f n (t)} in C[[t]] is a divided power pseudo-basis 
if and only if the map f n (t) f—*t n ,n>0, defines an automorphism of the Baxter 
algebra C[[t]]. 

This theorem provides a link between umbral calculus and Baxter algebra. 
This characterization of the umbral calculus in terms of Baxter algebra also 
motivates us to study a generalization of binomial type sequences. 

Definiton 3.10 A sequence {p n (x) \ n G N} of polynomials in C[x] is a se- 
quence of A-binomial type if 

n n 

p n (x + y) = A fc (") (*) Pi( x )Pn+k-i(y), Vy e C, n € N. 

k=0 i=0 1 

When A = 0, we recover the sequences of binomial type. Denote 

ex{x) = ^ — 

for the series 

A^V 

^ fc! 

k=l 

When A = 0, we get a\(x) = x. We verify that 

q := {{e x {x)T) n 

is a sequence of A-binomial type of C[[x]]. Let C<q> be the C-submodule of 
C[[x]] generated by elements in q. 

Let f(t) be a power series of order 1. Define 

, mm /(t)(/(t)-A)---(/(t)-(n-l)A) 

dn{j){t) = : , n > 0. 

n\ 

Then P : C[[t]] — » C[[t]] , d n (f) t-^> d n +i(f) defines a weight A Baxter operator 
on C[[i]]. Such a sequence is called a Baxter pseudo-basis of C[[f]]. 

Definiton 3.11 Fix a A G C. The algebra C[[i]], together with the weight A 
Baxter pseudo-basis {d n (t)} n , is called the X-umbral algebra. 
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As in the classical case, we identify C[[t}] with the dual algebra of C<q> 
by taking {d n (t)} to be the dual basis of {{e\(x)) n }. We then extend the 
classical theory of the umbral calculus to the A-umbral calculus. In particular, 
Theorem 13.81 is generalized to 

Theorem 3.12 [E] 

(a) A pseudo-basis {s n (x)} ofC[[x}] is of X-binomial type if and only if {s n (x)} 
is the dual basis of a Baxter pseudo-basis of C[[t]]. 

(b) Any Baxter pseudo-basis of C[[t]] is of the form {d n (f)} for some f(t) in 
C[[t\] of order 1. 
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